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Widespread intragenic transcription initiation has been 
observed in many species. Here we show that the 
Escherichia coli ehxCABD operon contains numerous 
intragenic promoters in both sense and antisense ori- 
entations. Transcription from these promoters is si- 
lenced by the histone-like nucleoid structuring (H-NS) 
protein. On a genome-wide scale, we show that 46% of 
H-NS-suppressed transcripts in E. coli are intragenic in 
origin. Furthermore, many intergenic promoters re- 
pressed by H-NS are for noncoding RNAs (ncRNAs). 
Thus, a major overlooked function of H-NS is to prevent 
transcription of spurious RNA. Our data provide a molec- 
ular description for the toxicity of horizontally acquired 
DNA and explain how this is counteracted by H-NS. 
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Pervasive transcription is a phenomenon ubiquitous across 
the three domains of life (Dornenburg et al. 2010). It is 
defined by an undercurrent of noncanonical transcripts 
within the primary transcriptome (Clark et al. 201 1 ). In 
bacteria, such transcripts arise from normal promoter 
elements but are conspicuous because they are non- 
coding, not delineated by gene boundaries, and some- 
times antisense (Peters et al. 2012). Sites of pervasive 
transcription are poorly conserved. Hence, even be- 
tween closely related bacteria, patterns of pervasive 
transcription differ (Raghavan et al. 2012). This has led 
to speculation that pervasive transcription is a conse- 
quence of the evolution of protein-coding genes (Peters 
et al. 2012; Raghavan et al. 2012). It is likely that 
pervasive transcription is suppressed, as spurious RNA 
synthesis is potentially catastrophic. Synthesis, stability, 
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and translation of mRNA may be perturbed (Dornenburg 
et al. 2010), and formation of R loops (DNA:RNA hybrids) 
can destabilize genome integrity (Gowrishankar and 
Harinarayanan 2004). 

In Escherichia coli, there are two major sources of 
pervasive transcription: inefficient termination of RNA 
synthesis at gene boundaries and spurious transcription 
initiation in genes (Dornenburg et al. 2010; Peters et al. 
2012). Attention has turned to elucidating mechanisms 
that suppress these phenomena. As a first step, Peters 
et al. (2012) identified Rho and the cof actor NusG as 
suppressors of pervasive antisense transcription. In their 
model, Rho acts via two pathways. First, Rho ensures 
termination of transcripts at gene boundaries, preventing 
"leaky" transcription of downstream genes. Second, Rho 
terminates antisense transcripts arising from promoters 
in genes. Thus, on a global scale, noncanonical tran- 
scripts are targeted for termination. However, since 
most transcription is regulated at the level of initiation 
(Browning and Busby 2004), additional mechanisms to 
suppress pervasive transcription are likely. 

The histone-like nucleoid structuring (H-NS) protein 
is a DNA folding factor. It is abundant in bacteria and 
influences transcription (Dillon and Dorman 2010). Nu- 
cleoprotein containing H-NS is structurally diverse,- 
H-NS can form bridged or filamentous DNA complexes 
(Dame et al. 2006; Liu et al. 2010). Two mechanisms for 
transcriptional repression by H-NS have been proposed: 
(1) exclusion, in which H-NS masks RNA polymerase 
(RNAP)-binding sites (Lim et al. 2012), and (2) trapping, in 
which H-NS ensnares initiating RNAP in DNA loops 
(Dame et al. 2002). H-NS may also slow elongation, leading 
to increased Rho-dependent termination (Peters et al. 2012). 
Chromatin immunoprecipitation (ChIP) has been used 
to map binding of H-NS across bacterial chromosomes 
(Grainger et al. 2006; Lucchini et al. 2006; Navarre et al. 
2006; Oshima et al. 2006; Kahramanoglou et al. 2011). 
These studies show that H-NS targets A/T-rich DNA, a 
distinguishing feature of horizontally acquired genes. 
Consequently, H-NS has been proposed to facilitate 
genome evolution by silencing transcription of foreign 
DNA (Navarre et al. 2006). A central tenet of this model is 
that foreign DNA is toxic. However, a molecular expla- 
nation for this toxicity and how H-NS counteracts this is 
lacking. 

Here, we describe widespread suppression of intragenic 
transcription initiation in bacteria. We show that this is 
a major function of H-NS. First, we show that H-NS 
represses transcription from many intragenic promoters 
within the A/T-rich ehxCABD operon of E. coli 0157:H7. 
Second, we show that most H-NS-repressed promoters 
genome-wide in E. coli K-12 are for noncoding RNAs 
(ncRNAs), with many promoters located inside genes. 
Third, we show that suppression of widespread intra- 
genic transcription is a likely function of H-NS analogs 
in diverse bacterial species. 
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Results and Discussion 

The enterohemorrhagic E. coli 0157:H7 ehxCABD 
operon contains many intragenic promoters 

The ehxCABD operon is a well-characterized H-NS target 
(Li et al. 2008; Rogers et al. 2009; Iyoda et al. 2011). Our 
previous analysis of ehxCABD suggested the existence 
of intragenic promoters (Singh and Grainger 2013). We 
began here by scanning the operon for sequences that 
resemble promoter —10 elements. As a control, we applied 
the same analysis to the E. coli 0157:H7 and K-12 
genomes. For ehxCABD, our analysis identified 95 puta- 
tive intragenic —10 elements, an approximately fourfold 
enrichment over the genome averages for both E. coli 
strains. To test for function, we cloned the 95 sequences, 
each on a 62-base-pair (bp) DNA fragment, upstream of 
lacZ in the reporter plasmid pRW50 (Supplemental 
Fig. SI). In some cases, predicted -10 elements over- 
lapped. Hence, we were able to clone the 95 sequences 
on 83 DNA fragments. Twenty of the fragments stim- 
ulated p-galactosidase activity twofold or more above 
background (Fig. 1). We selected five of the 20 active 
constructs representing sense and antisense promoters 
with a wide range of activities (labeled a-e in Fig. 1) and 
examined the effects of disrupting the predicted —10 
element. In all cases, this resulted in a significant de- 
crease in p-galactosidase activity (Supplemental Fig. S2). 

Direct repression of intragenic promoters 
in the ehxCABD operon by H-NS 

Transcription of ehxCABD is well characterized as being 
repressed by H-NS (Li et al. 2008; Rogers et al. 2009; Iyoda 
et al. 2011). Despite this, H-NS is unable to block tran- 
scription from the canonical ehxCABD promoter (Singh 
and Grainger 2013). We hypothesized that H-NS acts via 
the intragenic ehxCABD promoters. To test this, we 
measured (B-galactosidase activity in wild- type and Ahns 
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Figure 1. The E. coli ehxCABD operon contains many intragenic 
promoters. (3-Galactosidase activity driven by short intragenic 
ehxCABD promoter fragments in strain JCB387. The top chart 
shows the activity of promoters that drive sense transcription, and 
the bottom chart shows the activity of promoters that drive 
antisense transcription. Each data point is aligned with the corre- 
sponding section of the ehxCABD operon. Data points labeled a-e 
correspond to promoters used in further experiments. The red bar 
represents the canonical promoter for ehxCABD. Background activ- 
ity (empty pRW50) is shown by a green line. 



cells containing different sections of ehxCABD fused to 
lacZ in plasmid pRW50. First, we examined a large, 5429- 
bp section of ehxCABD lacking the canonical upstream 
promoter. Transcription initiating in this coding DNA, 
which is repressed fivefold by H-NS, must be due to in- 
tragenic promoters (Fig. 2 A, panel i). Sequential removal of 
upstream sequences reduces repression by H-NS (Fig. 2 A, 
panel ii). Similarly, transcription from promoters a-e was 
unaffected by H-NS in the context of the 62-bp fragments 
(Fig. 2A, panel iii). We conclude that H-NS represses 
transcription from intragenic ehxCABD promoters, but 
repression requires extensive flanking DNA. Note that 
the sequence of the flanking DNA is also crucial because, 
when flanked by plasmid DNA, the 62-bp ehxCABD 
fragments were not subject to repression by H-NS. 

Given that H-NS repressed intragenic ehxCABD pro- 
moters only in the context of larger ehxCABD fragments, 
we predicted that H-NS would bind tightly throughout 
the operon but weakly to small fragments in pRW50. 
Consistent with this, H-NS forms bridged and filamen- 
tous complexes with DNA that are more stable than 
isolated H-NS-binding events (Bouffartigues et al. 2007). 
Hence, we measured binding of H-NS to promoters a-e 
in the context of the large (5429-bp) and short (62-bp) 
fragments of ehxCABD cloned in pRW50. ChIP was used 
to measure binding of H-NS to the different targets in 
vivo. Figure 2B shows PCR analysis of DNA immunopre- 
cipitated with anti-H-NS. Control experiments analyz- 
ing DNA recovered from mock immunoprecipitations are 
also shown. As predicted, loci a-e bind H-NS in the 
context of the large ehxCABD fragment (Fig. 2B, top 
panel). Conversely, H-NS did not bind the same loci when 
they were isolated from the rest of the operon (Fig. 2B, 
bottom panel). No enrichment of the control chromo- 
somal yabN locus, which is not bound by H-NS, was 
detected in any immunoprecipitate (Supplemental Fig. 
S3). In complementary experiments, we measured bind- 
ing of H-NS to the same DNA fragments in vitro using 
electrophoretic mobility shift assays (EMS As). H-NS 
bound to the 5429-bp ehxCABD fragment but not the 
short intragenic promoter fragments (Fig. 2C). 

To confirm direct H-NS-mediated repression of intra- 
genic ehxCABD promoters, we recapitulated the phenom- 
enon in vitro. Two DNA fragments were used as templates 
for in vitro transcription: a 743 5 -bp DNA fragment con- 
taining the entire ehxCABD operon and upstream region 
(Fig. 2D, lanes 3-9) and a 6435-bp fragment lacking the 
upstream region and canonical promoter (Fig. 2D, lanes 10- 
16). RNAP catalyzed the synthesis of multiple transcripts of 
varying size from each template. Consistent with a previous 
report (Singh and Grainger 2013), transcription from the 
canonical ehxCABD promoter was unaffected by the addi- 
tion of H-NS, even at high concentrations (7312-nucleo- 
tides [nt] RNA) (Fig. 2D, lanes 3-9). Conversely, synthesis 
of shorter transcripts, ranging from <100 nt to thousands 
of nucleotides in length, was dramatically reduced by 
increasing concentrations of H-NS (Fig. 2D, lanes 4-9,1 1- 
16). The similarity of transcript profiles synthesized from 
the two templates confirms that the short RNAs arise 
from internal initiation events and not from stalled 
transcription from the canonical promoter. 

Widespread repression of intragenic promoters by H-NS 

To determine whether H-NS suppresses intragenic tran- 
scription at other loci, we investigated the effect of H-NS 
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Figure 2. H-NS suppresses intragenic ehxCABD transcription. [A) H-NS suppresses intragenic ehxCABD transcription in vivo. |3-Galactosidase 
activity stimulated by ehxCABD fragments fused to lacZ in plasmid pRW50. Intragenic promoters a-e are labeled. (3-Galactosidase activity was 
measured in MG1655 and the Aims derivative. Background p-galactosidase activity, obtained with empty pRW50 vector, was subtracted from 
the values shown. Experiments were done in triplicate, and the standard deviation was always <10% of the mean. [B) H-NS binds promoters 
within the ehxCABD operon in vivo. ChlP-PCR was used to measure binding of H-NS at intragenic ehxCABD loci cloned in pRW50. PCR 
products were generated with primers to detect the five intragenic ehxCABD promoters, a-e. (C) H-NS binds promoters within the ehxCABD 
operon in vitro. Electrophoretic mobility shift assays with either small (62-bp) or large (5429-bp) fragments of ehxCABD. The different small DNA 
fragments are labeled a-e. H-NS was added at a concentration of 4 (jlM or 8 |iM. (D) H-NS suppresses intragenic ehxCABD transcription in vitro. A 
run-off in vitro transcription assay using DNA templates consisting of the whole ehxCABD operon with (7435 bp) and without (6435 bp) the 
upstream regulatory region. The gel was calibrated with a sequencing reaction (lane 1 ) and the radiolabeled DNA template (lane 2). RNAP was used 
at a concentration of 200 nM (lanes 3-16), and H-NS was added at concentrations of 150, 300, 600, 750, 1500, and 3000 nM (lanes 4-9,11-16). 



on genome-wide transcription initiation in E. coli K-12. 
We chose E. coli K-12 because (1) H-NS has been studied 
most extensively in this strain, (2) H-NS in E. coli K-12 is 
100% identical to that in E. coli 0157:H7, and (3) high- 
resolution genome-wide binding data for H-NS, generated 
using ChIP sequencing (ChlP-seq), are available. As a 
starting point, we determined association of a 70 with the 
E. coli K-12 genome in wild- type and Ahns cells using 
ChlP-seq. Thus, we identified the location of promoter- 
bound RNAP and the level of occupancy (Supplemental 
Table SI). In parallel, we identified transcription start 
sites (TSSs) genome-wide in wild-type and Ahns cells 
using a modified RNA sequencing (RNA-seq) approach 
(Singh and Wade 2014). This method also determines the 
relative abundance of RNAs associated with each TSS 
("TSS strength"). Together, these data identify sites of 
transcription initiation with high confidence and nucle- 
otide resolution. Visual inspection of the data revealed 
many promoters repressed by H-NS (Fig. 3A). These 
promoters typically correspond to regions of high H-NS 
binding (Kahramanoglou et al. 2011). Repression was 



reflected by increases in both a 70 occupancy and TSS 
strength in the Ahns mutant. Many of the H-NS -repressed 
promoters are located in genes. 

We selected a 70 ChIP -seq peaks that collocated with 
TSSs in the Ahns mutant. Thus, we identified 668 pro- 
moters with associated information for a 70 occupancy, 
TSS strength, and TSS orientation (Supplemental Table 
SI). We divided the promoters into "low H-NS" (508 
promoters) and "high H-NS" (160 promoters) categories 
based on the level of H-NS binding. For each promoter, a 70 
binding or TSS strength was compared in Ahns and wild- 
type cells. The result of the comparison is in Figure 3, B (for 
cr binding) and C (for TSS strength). In the scatter plots, 
each data point represents one of the 668 promoters. 
Consistent with H-NS-mediated repression, association 
of a 70 with promoters in high H-NS regions was signifi- 
cantly higher in Ahns cells than in wild- type cells (Mann 
Whitney U test, normal approximation, P = 1.4 X 10~ 23 ) 
(Fig. 3B). Similarly, for high H-NS regions, TSS strength 
was significantly higher in Ahns cells (Mann Whitney U 
test, normal approximation, P = 2.7 X 10~ 10 ) (Fig. 3C). In 
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contrast, for low H-NS regions, neither cr 70 binding nor 
TSS strength was significantly different in Ahns and wild- 
type cells (Mann Whitney U test, normal approximation, 
P = 0.29 for a 70 occupancy and P = 0.23 for TSS strength) 
(Fig. 3B,C). Data for all 668 promoters are summarized in 
Figure 3, D (a 70 binding) and E (TSS strength). As 
expected, the mean wild- type/ Ahns ratio for both a 70 
binding and TSS strength is close to 1.0 in low H-NS 
regions (Fig. 3D,E, blue box). Conversely, the equivalent 
ratio is 0.2 or 0.3 for high H-NS regions (Figs. 3D,E, first 
red box). Taken together, our data indicate that high 
H-NS occupancy reduces the binding of initiating RNAP 
and represses transcription, consistent with previous 
studies (Dillon and Dorman 2010). 

We next examined the position of the 668 promoters 
with respect to genes. Promoters in high H-NS regions are 
far more likely to be inside genes (46% of promoters) than 
promoters in low H-NS regions ( 19% of promoters,- Fisher's 
exact test, P = 2.0 X 10 -11 ). However, an intragenic or 
intergenic location for a promoter does not fully describe 
its genomic context. For example, an intergenic promoter 
may be orientated opposite to surrounding genes and 
unable to drive mRNA transcription. Conversely, an 
intragenic promoter may be close to a correctly orien- 
tated gene, as is the case for many known mRNA pro- 
moters (Keseler et al. 2013). Hence, we next determined 
the distance between each of the 668 promoters and the 
nearest properly orientated gene. These distances repre- 
sent the length of potential mRNA 5' untranslated regions 
(UTRs). Of the 5' UTRs reported by RegulonDB, 95% are 
<227 nt in length (Salgado et al. 2013). However, only 41 % 
of potential 5' UTRs in high H-NS regions are <227 nt in 
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length. This is significantly lower than in low H-NS regions, 
where 80% of mRNA 5' UTRs are <227 nt in length (Mann 
Whitney U test, normal approximation, P = 2.4 X 1CT 21 ) 
(Fig. 3F). We conclude that most promoters in regions bound 
by H-NS, including 31 % of intergenic promoters (Fig. 3G, 
dark-green sector of right pie chart), likely drive synthe- 
sis of spurious ncRNAs (i.e., ncRNAs lacking function). 
However, we do not exclude the possibility that a small 
number of such promoters drive synthesis of an anno- 
tated regulatory RNA or mRNA. The genomic context of 
promoters in high and low H-NS regions is summarized 
in Figure 3G. Interestingly, while both intragenic and 
intergenic promoters in high H-NS regions are repressed 
by H-NS, repression of intragenic promoters is signifi- 
cantly greater (Mann Whitney U test, normal approxima- 



Figure 3. H-NS suppresses hundreds of intragenic promoters ge- 
nome-wide. (A) Examples of promoters repressed by H-NS. Three 
genomic regions are shown. Data for H-NS occupancy are shown by 
the brown graph. The dashed line indicates the cutoff separating 
H-NS "high" and "low" regions, a 70 ChlP-seq data from wild-type 
cells are shown in green, and data from Alms cells are shown in 
purple. TSS scores (sequence read density for a given TSS) from wild- 
type (green) and Alms (purple) cells are also shown. The TSS score is 
shown only for TSSs identified in the Aims mutant that were 
collocated with a a 70 ChlP-seq peak. The scale for H-NS ChlP-seq 
data is the same in all three panels. The scales for a 70 ChlP-seq and 
TSS data are different in each panel but consistent within each 
panel. [B] H-NS reduces binding of a 70 at many promoters. Scatter 
plot showing ChlP-seq a 70 occupancy for wild-type and Alms cells. 
Each data point represents a a 70 ChlP-seq peak from the Aims 
mutant that was collocated with a TSS. Blue and red data points 
are for a 70 peaks in low and high H-NS regions, respectively. (C) 
H-NS reduces transcription from many promoters. Scatter plot 
showing TSS strength for wild-type and Alms cells. Each data point 
represents a TSS from the Alms mutant that was collocated with 
a a 70 ChlP-seq peak. Blue and red data points are for TSSs in low and 
high H-NS regions, respectively. (D) H-NS occludes o- 70 from pro- 
moters inside genes, especially those that are horizontally acquired 
or have been described as being repressed by H-NS. Box plot showing 
the distribution of the ratio of a 70 occupancy in wild- type/ Alms 
cells, as determined using ChlP-seq, for selected groups of promoters 
(cr 70 ChlP-seq peaks collocated with TSSs). Boxes indicate the 25th- 
75th percentile. Horizontal lines indicate the median value. The 
whiskers indicate the 5th-95th percentile. The number of contrib- 
uting promoters is indicated. Categories of promoters tested were ( 1 ) 
promoters in high H-NS regions, (2) promoters in low H-NS regions, 
(3) intragenic promoters in high H-NS regions, (4) intergenic pro- 
moters in high H-NS regions, (5) promoters in high H-NS regions 
that are <227 bp from the nearest appropriately orientated annotated 
gene start (mRNA), (6) promoters in high H-NS regions that are 
>227 bp from the nearest appropriately orientated annotated gene 
start (ncRNA), (7) promoters in horizontally acquired genes, and (8) 
promoters in genes described previously as being repressed by H-NS. 
(£) H-NS represses transcription from promoters inside genes, 
especially those that are horizontally acquired or have been de- 
scribed as being repressed by H-NS. Box plot showing the distribu- 
tion of the ratio of TSS strength in wild- type/ Alms cells (TSSs 
collocated with a 70 ChlP-seq peaks). Boxes indicate the 25th-75th 
percentile. Horizontal lines indicate the median value. The whiskers 
indicate the 5th-95th percentile. The number of contributing pro- 
moters is indicated. The categories of promoters tested match those 
for D. (F) Promoters in H-NS regions are often far from gene starts. 
Cumulative frequency distribution for the distance of TSSs (from 
ChlP-seq peak/TSS pairs) from the nearest appropriately orientated, 
annotated gene start for TSSs in low H-NS (blue line) and high H-NS 
(red line) regions. For a distance, x, on the X-axis, the value on the 
Y-axis indicates the proportion of TSSs that are <x bp from the 
nearest appropriately orientated, annotated gene start. The dashed line 
indicates 227 bp. (G) Promoters in H-NS regions are often intragenic 
and/or drive transcription of ncRNAs. Pie charts showing the distri- 
bution of TSS location relative to genes in Alms cells. 
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tion, P = 0.03 for cr 70 occupancy and P = 0.01 for 
TSS strength) (Fig. 3D,E, cf. the first three red 
data points). Thus, intragenic promoters are 
more likely to be repressed and are more strongly 
repressed by H-NS. Importantly, intragenic pro- 
moters in H-NS -bound regions are indistinguish- 
able from all other promoters with respect to 
DNA sequence (Supplemental Fig. S4). 

We next compared the position of H-NS-re- 
pressed promoters with horizontally acquired 
genes in the known H-NS regulon. Strikingly, 
intragenic promoters repressed by H-NS align with both 
horizontally acquired DNA and genes proposed to be re- 
pressed by H-NS (Fig. 3D,E, final two data points,- Supple- 
mental Table SI; Garcia-Vallve et al. 2003; Kahramanoglou 
et al. 201 1;). We note that previous studies used microarray- 
based approaches unable to distinguish canonical mRNAs 
from ncRNAs. We conclude that many previous observa- 
tions of repression by H-NS are in fact due to repression of 
promoters inside genes. 

Binding of H-NS analogs coincides with predicted 
intragenic promoters in diverse bacterial species 

MvaT and Lsr2 are functional analogs of H-NS in Pseudo- 
monas spp. and Mycobacterium spp., respectively. For both 
proteins, ChlP-chip has been used to measure binding 
across the cognate genome (Castang et al. 2008; Gordon 
et al. 2010). Fortuitously, promoters for primary cr factors do 
not vary greatly between these species and E. coli (Potvin 
et al. 2008; Sachdeva et al. 2010). Hence, we were able to use 
the same approach used for the E. coli genomes to search 
for —10 hexamers. We then compared the position of 
these putative —10 hexamers with binding of MvaT/Lsr2. 
Putative promoters were enriched approximately fivefold 
in MvaT-bound (x 2 test, P = 8.9 X 10" 9 ) and Lsr2 -bound 
(x 2 test, P = 5.5 X 10~ 4 ) regions. Furthermore, most putative 
promoters in MvaT- and Lsr2 -bound regions were located in 
genes (58% for Pseudomonas aeruginosa and 78% for 
Mycobacterium tuberculosis). Thus, MvaT and Lsr2 are 
likely to be suppressors of intragenic transcription. 

Conclusions 

H-NS has been proposed to facilitate genome evolution 
by binding horizontally acquired DNA. Previous studies 
have concluded that H-NS silences such genes solely by 
repressing canonical transcription (Fig. 4A). In contrast, 
we show that the primary function of H-NS is to silence 
spurious transcription of ncRNAs, often inside the genes 
previously described as being H-NS-repressed (Fig. 4B). 
Hence, the fitness defect of deleting hns is likely due in 
part to increased spurious transcription. We suggest that, 
by suppressing intragenic transcription, H-NS prevents 
misappropriation of cellular resources, the formation of 
dangerous nucleic acid structures, and transcription in- 
terference. We note that the two models for H-NS function 
shown in Figure 4 are indistinguishable using DNA micro- 
array-based approaches that cannot resolve canonical 
mRNAs and spurious ncRNAs. Thus, while we have 
identified a largely new set of H-NS regulated promoters, 
these promoters coincide with the existing H-NS regu- 
lon. Previous bioinformatic screens predicted that H-NS- 
bound genes contain promoters (Tutukina et al. 2007). 
Such genes are often horizontally acquired, are A/T-rich, 
and have many TpA steps common to both H-NS targets 
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Figure 4. A new model for silencing at horizontally acquired genes by H-NS. 



and promoter —10 elements (Bouffartigues et al. 2007; 
Singh et al. 2011). Hence, these genes are ideal candidates 
for recognition by H-NS or its functional analogs and for 
spurious RNAP binding. In such regions, high-affinity ca- 
nonical promoters and transcription factors likely play a key 
role, along with H-NS, in facilitating specific transcription. 

Materials and methods 

Strains and plasmids 

E. coli JCB387 Am'r Mac and MG1655 have been described (Page et al. 
1990; Keseler et al. 2013). The Ahns MG1655 derivative (Fig. 2) was 
provided by Ding Jin. Plasmid pRW50 was described by Lodge et al. (1992). 
The 7435-bp ehxCABD fragment was synthesized by DNA2.0 and con- 
tained silent mutations to remove restriction sites to facilitate cloning. The 
~62-bp ehxCABD fragments were generated with oligonucleotides (Sup- 
plemental Table S2). A small number of fragments contained more than one 
putative promoter and were thus larger (up to 80 bp). Mutations in -10 
elements were introduced with oligonucleotide derivatives (Supplemental 
Table S2). Mutations were numbered on the assumption that - 10 elements 
are 7 bp upstream of a TSS. The Ahns--kan R MG1655 derivative used for 
ChlP-seq was constructed by PI transduction of Aims : : kan R from BW25 113 
Ai2r?s::kan R (Baba et al. 2006) into MG1655. Derivatives of BW25113 
[A{araD-araB)567, MacZ4787[::rrnB-3), rph-1, A{rhaD-rhaB)568, and 
hsdR5l4] were used for TSS mapping. Unmarked BW25113 ArppH was 
constructed by removal of kan R from BW25113 ArppH::kan R using FLP 
recombinase from plasmid pCP20 (Datsenko and Wanner 2000). BW25113 
ArppH Ahns : : kan R was constructed by PI transduction of Aims : : kan R from 
BW25113 Ahns::kan R into the unmarked BW25113 ArppH. 

Promoter DNA fragments and in vivo gene expression assays 

(3-Galactosidase assays were done using the protocol of Miller (1972). All 
assay values are the mean of three independent experiments with a 
standard deviation >10% of the mean. Cells were grown aerobically at 
37°C to mid-log phase in LB medium. 

Proteins, EMSA, and in vitro transcription assays 

H-NS and RNAP were prepared as described (Grainger et al. 2008). DNA for 
EMSA was generated by PCR using the oligonucleotides listed in Supple- 
mental Table S 1 . DNA fragments were cut with Hindlll, end- labeled using 
(7- 32 P)-ATP, and used at —10 nM concentration. The in vitro transcription 
assay was done as described by Grainger et al. (2008). DNA templates were 
generated by PCR with the oligonucleotides listed in Supplemental Table 
S2. The template (—16 \xg mL" 1 ) was preincubated with purified H-NS in 
20 mM Tris (pH 7.9), 5 mM MgCl 2/ 500 \xM DTT, 50 mM KCl, 100 ng mL" 1 
BSA, 200 nM ATP, 200 |UM GTP, 200 \xM CTP, and 10 nM UTP with 5 \id 
(a- 32 P)-UTP. The reaction was started with 400 nM E. coli RNAP. Tran- 
scripts were analyzed on a denaturing polyacrylamide gel. 



Bioinformatic analysis of promoters 

We used our alignment of 554 E. coli promoters to determine selection 
criteria (Singh et al. 2011). Thus, putative —10 elements were chosen if 
they matched the sequence 5'-TAnAAT-3', 5'-TATnAT-3' or 5 ' -TATAnT-3 ' . 
The GenoList database (http://genolist.pasteur.fr) was used to interrogate 
bacterial genomes and determine the distribution of potential —10 hexam- 
ers between genes and noncoding DNA. We used the list of MvaT-bound 
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regions described by Castang et al. (2008). Lsr2-bound regions in 
M. tuberculosis H37Rv were determined by applying a cutoff of 1.5 to the 
ChlP-chip data set of Gordon et al. (2010). 

ChIP, Chip-seq, and TSS mapping 

Targeted ChIP experiments (Fig. 2) were done as described (Singh and 
Grainger 2013) using ehxCABD fragments cloned in pRW50. For ChlP-seq 
experiments, cultures were inoculated 1:100 in LB with fresh overnight 
cultures of MG1655 or MG1655 Ahns::kan R , and cells were grown at 225 
rpm at 30°C to an OD 600 of 0.5-0.6. Protocols for ChlP-seq, TSS mapping, 
and analysis of these data are described in Supplemental Appendix SI. 
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